Wordspotting using a predictive neural model for the telephone speech corpus
نویسندگان
چکیده
We describe a wordspotting algorithm based on a predictive neural model for a telephone speech corpus. Each keyword is modeled as a whole word. For keyword detection scoring we used a minimum accumulated prediction residual. We computed empirically a threshold value for rejecting non-keyword speech in place of building non-keyword models. We tested the algorithm with the TUBTEL telephone speech corpus and compared it with other algorithms like the standard DTW-based wordspotting algorithm and the twostage wordspotting algorithm based on a DTW and a multilayer perceptron.
منابع مشابه
Empirical comparison of two multilayer perceptron-based keyword speech recognition algorithms
In this paper, an empirical comparison of two multilayer perceptron (MLP)-based techniques for keyword speech recognition (wordspotting) is described. The techniques are the predictive neural model (PNM)-based wordspotting, in which the MLP is applied as a speech pattern predictor to compute a local distance between the acoustic vector and the phone model, and the hybrid HMM/MLP-based wordspott...
متن کاملImproving English Conversational Telephone Speech Recognition
The goal of this work is to build a state-of-the-art English conversational telephone speech recognition system. We investigated several techniques to improve acoustic modeling, namely speaker-dependent bottleneck features, deep Bidirectional Long Short-Term Memory (BLSTM) recurrent neural networks, data augmentation and score fusion of DNN and BLSTM models. Training set consisted of the 300 ho...
متن کاملAllophone-based acoustic modeling for Persian phoneme recognition
Phoneme recognition is one of the fundamental phases of automatic speech recognition. Coarticulation which refers to the integration of sounds, is one of the important obstacles in phoneme recognition. In other words, each phone is influenced and changed by the characteristics of its neighbor phones, and coarticulation is responsible for most of these changes. The idea of modeling the effects o...
متن کاملTelephone speech recognition using neural networks and hidden Markov models
The performance of well trained speech recognizers using high quality full bandwidth speech data is usually degraded when used in real world environments In particular telephone speech recognition is extremely di cult due to the limited bandwidth of transmission channels In this paper neural network based adaptation methods are applied to telephone speech recognition and a new unsupervised mode...
متن کاملRejection of the Feed-Flow Disturbances in a Multi-Component Distillation Column Using a Multiple Neural Network Model-Predictive Controller
This article deals with the issues associated with developing a new design methodology for the nonlinear model-predictive control (MPC) of a chemical plant. A combination of multiple neural networks is selected and used to model a nonlinear multi-input multi-output (MIMO) process with time delays. An optimization procedure for a neural MPC algorithm based on this model is then developed. T...
متن کامل